Robust linear classification from limited training data

نویسندگان

چکیده

We consider the problem of linear classification under general loss functions in limited-data setting. Overfitting is a common here. The standard approaches to prevent overfitting are dimensionality reduction and regularization. But loses information, while regularization requires user choose norm, or prior, distance metric. propose an algorithm called RoLin that needs no choice applies large class functions. combines “reliable” information from top principal components with robust optimization extract any useful “unreliable” subspaces. It also includes new cross-validation better than existing methods Experiments on 25 real-world datasets three show broadly outperforms both Dimensionality has $$14\%-40\%$$ worse test average as compared RoLin. Against $$L_1$$ $$L_2$$ regularization, can be up 3x for logistic 12x squared hinge loss. differences greatest small sample sizes, where achieves best 2x more competing method. For some datasets, 15 training samples norm-based 1500 samples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Noise-Robust Hidden Markov Models for Limited Training Data for Within-Species Bird Phrase Classification

Hidden Markov Models (HMMs) have been studied and used extensively in speech and birdsong recognition, but they are not robust to limited training data and noise. This paper presents two novel approaches to training continuous and discrete HMMs with extremely limited data. First, the algorithm learns the global Gaussian Mixture Models (GMMs) for all training phrases available. GMM parameters ar...

متن کامل

Constructing Robust Neural Decoders Using Limited Training Data

One of the essential components of a neuromotor prosthetic device is a neural decoder that translates the activity of a set of neurons into an estimate of the intended movement of the prosthetic limb. Wiener filter style approaches model this transformation as a linear function of the number of spikes observed from a set of neurons and over a range of distinct time bins. More recently, research...

متن کامل

Semantic Segmentation from Limited Training Data

We present our approach for robotic perception in cluttered scenes that led to winning the recent Amazon Robotics Challenge (ARC) 2017. Next to small objects with shiny and transparent surfaces, the biggest challenge of the 2017 competition was the introduction of unseen categories. In contrast to traditional approaches which require large collections of annotated data and many hours of trainin...

متن کامل

Classification of High Dimensional Data with Limited Training Samples

.. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv CHAPTER 1: INTRODUCTION ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .1 1.1 Background ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . ....

متن کامل

Deep Learning Methods for Classification with Limited Training Data

The human brain has an inherent ability to learn to react to something with just one past experience. The quest for Artificial Intelligence has brought us to the situation where machines simulating the abilities of the human brain are being developed. In this context, the a new flavour of the evergreen classification problem, that is, to classify data having seen few training instances becomes ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Machine Learning

سال: 2021

ISSN: ['0885-6125', '1573-0565']

DOI: https://doi.org/10.1007/s10994-021-06093-5